72 research outputs found

    Hierarchical multi-stream posterior based speech secognition system

    Get PDF
    Abstract. In this paper, we present initial results towards boosting posterior based speech recognition systems by estimating more informative posteriors using multiple streams of features and taking into account acoustic context (e.g., as available in the whole utterance), as well as possible prior information (such as topological constraints). These posteriors are estimated based on “state gamma posterior ” definition (typically used in standard HMMs training) extended to the case of multi-stream HMMs.This approach provides a new, principled, theoretical framework for hierarchical estimation/use of posteriors, multi-stream feature combination, and integrating appropriate context and prior knowledge in posterior estimates. In the present work, we used the resulting gamma posteriors as features for a standard HMM/GMM layer. On the OGI Digits database and on a reduced vocabulary version (1000 words) of the DARPA Conversational Telephone Speech-to-text (CTS) task, this resulted in significant performance improvement, compared to the stateof-the-art Tandem systems.

    Multilingual representations for low resource speech recognition and keyword search

    Get PDF
    © 2015 IEEE. This paper examines the impact of multilingual (ML) acoustic representations on Automatic Speech Recognition (ASR) and keyword search (KWS) for low resource languages in the context of the OpenKWS15 evaluation of the IARPA Babel program. The task is to develop Swahili ASR and KWS systems within two weeks using as little as 3 hours of transcribed data. Multilingual acoustic representations proved to be crucial for building these systems under strict time constraints. The paper discusses several key insights on how these representations are derived and used. First, we present a data sampling strategy that can speed up the training of multilingual representations without appreciable loss in ASR performance. Second, we show that fusion of diverse multilingual representations developed at different LORELEI sites yields substantial ASR and KWS gains. Speaker adaptation and data augmentation of these representations improves both ASR and KWS performance (up to 8.7% relative). Third, incorporating un-transcribed data through semi-supervised learning, improves WER and KWS performance. Finally, we show that these multilingual representations significantly improve ASR and KWS performance (relative 9% for WER and 5% for MTWV) even when forty hours of transcribed audio in the target language is available. Multilingual representations significantly contributed to the LORELEI KWS systems winning the OpenKWS15 evaluation

    The Simons Observatory: Cryogenic Half Wave Plate Rotation Mechanism for the Small Aperture Telescopes

    Full text link
    We present the requirements, design and evaluation of the cryogenic continuously rotating half-wave plate (CHWP) for the Simons Observatory (SO). SO is a cosmic microwave background (CMB) polarization experiment at Parque Astron\'{o}mico Atacama in northern Chile that covers a wide range of angular scales using both small (0.42 m) and large (6 m) aperture telescopes. In particular, the small aperture telescopes (SATs) focus on large angular scales for primordial B-mode polarization. To this end, the SATs employ a CHWP to modulate the polarization of the incident light at 8~Hz, suppressing atmospheric 1/f1/f noise and mitigating systematic uncertainties that would otherwise arise due to the differential response of detectors sensitive to orthogonal polarizations. The CHWP consists of a 505 mm diameter achromatic sapphire HWP and a cryogenic rotation mechanism, both of which are cooled down to ∌\sim50 K to reduce detector thermal loading. Under normal operation the HWP is suspended by a superconducting magnetic bearing and rotates with a constant 2 Hz frequency, controlled by an electromagnetic synchronous motor. The rotation angle is detected through an angular encoder with a noise level of 0.07ÎŒrads\mu\mathrm{rad}\sqrt{\mathrm{s}}. During a cooldown, the rotor is held in place by a grip-and-release mechanism that serves as both an alignment device and a thermal path. In this paper we provide an overview of the SO SAT CHWP: its requirements, hardware design, and laboratory performance.Comment: 19 pages, 21 figures, submitted to RS

    American Society of Clinical Oncology/College ofAmerican Pathologists guideline recommendations forimmunohistochemical testing of estrogen andprogesterone receptors in breast cancer

    Get PDF
    Purpose: To develop a guideline to improve theaccuracy of immunohistochemical (IHC) estrogen receptor(ER) and progesterone receptor (PgR) testing in breastcancer and the utility of these receptors as predictivemarkers.Methods: The American Society of Clinical Oncologyand the College of American Pathologists convened aninternational Expert Panel that conducted a systematicreview and evaluation of the literature in partnership withCancer Care Ontario and developed recommendations foroptimal IHC ER/PgR testing performance.Results: Up to 20% of current IHC determinations ofER and PgR testing worldwide may be inaccurate (falsenegative or false positive). Most of the issues with testinghave occurred because of variation in preanalyticvariables, thresholds for positivity, and interpretationcriteria.Recommendations: The Panel recommends that ER andPgR status be determined on all invasive breast cancers andbreast cancer recurrences. A testing algorithm that relieson accurate, reproducible assay performance is proposed.Elements to reliably reduce assay variation are specified. It is recommended that ER and PgR assays be consideredpositive if there are at least 1% positive tumor nuclei in the sample on testing in the presence of expected reactivity of internal (normal epithelial elements) and external controls. The absence of benefit from endocrine therapy for women with ER-negative invasive breast cancers has been confirmed in large overviews of randomized clinical trials.(Arch Pathol Lab Med. 2010;134:907–922

    Clinical presentation and predictors of outcome in patients with severe acute exacerbation of chronic obstructive pulmonary disease requiring admission to intensive care unit

    Get PDF
    BACKGROUND: Severe acute exacerbation of chronic obstructive pulmonary disease (AE-COPD) is a common reason for emergency room (ER) visit about which little has been documented from India. METHODS: Prospective study of the clinical presentation and predictors of outcome in 116 patients presenting with severe AE-COPD requiring admission to the medical intensive care unit between January 2000 and December 2004. RESULTS: Their mean age was 62.1 ± 9.8 years. There were 102 males. Mean duration of COPD was 7.2 ± 5.8 years. All males were smokers (22.3 ± 11.2 pack years); 35.2% smoked cigarettes and 64.8% smoked bidis. All women were exposed to domestic fuel. Associated co-morbid illnesses were present in 81 patients (69.8%); 53(45.7%) had one co-morbid illness and the remaining 28 (54.3%) had two or more co-morbid illnesses. Evidence of past pulmonary tuberculosis (PTB) was present in 28.4% patients; 5 patients who also had type II diabetes mellitus had active PTB. Arterial blood gas analysis revealed respiratory failure in 40 (33.8%) patients (type I 17.5% and type II 82.5%). Invasive mechanical ventilation was required in 18 patients. Sixteen (13.7%) patients died. Stepwise multivariate logistic regression analysis revealed need for invasive ventilation (OR 45.809, 95%CI 607.46 to 3.009;p < 0.001); presence of co-morbid illness (OR 0.126, 95%CI 0.428 to 0.037;p < 0.01) and hypercapnia (OR 0.114, 95%CI 1.324 to 0.010;p < 0.05) were predictors of death. CONCLUSION: Co-morbid conditions and metabolic abnormalities render the diagnosis of AE-COPD difficult and also contribute to mortality. High prevalence of past PTB and active PTB in patients with AE-COPD suggests an intriguing relationship between smoking, PTB and COPD which merits further study

    Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion

    Get PDF
    The electronic version of this article is the complete one and can be found online at: http://dx.doi.org/10.1186/s13636-015-0063-8Spoken term detection (STD) aims at retrieving data from a speech repository given a textual representation of the search term. Nowadays, it is receiving much interest due to the large volume of multimedia information. STD differs from automatic speech recognition (ASR) in that ASR is interested in all the terms/words that appear in the speech data, whereas STD focuses on a selected list of search terms that must be detected within the speech data. This paper presents the systems submitted to the STD ALBAYZIN 2014 evaluation, held as a part of the ALBAYZIN 2014 evaluation campaign within the context of the IberSPEECH 2014 conference. This is the first STD evaluation that deals with Spanish language. The evaluation consists of retrieving the speech files that contain the search terms, indicating their start and end times within the appropriate speech file, along with a score value that reflects the confidence given to the detection of the search term. The evaluation is conducted on a Spanish spontaneous speech database, which comprises a set of talks from workshops and amounts to about 7 h of speech. We present the database, the evaluation metrics, the systems submitted to the evaluation, the results, and a detailed discussion. Four different research groups took part in the evaluation. Evaluation results show reasonable performance for moderate out-of-vocabulary term rate. This paper compares the systems submitted to the evaluation and makes a deep analysis based on some search term properties (term length, in-vocabulary/out-of-vocabulary terms, single-word/multi-word terms, and in-language/foreign terms).This work has been partly supported by project CMC-V2 (TEC2012-37585-C02-01) from the Spanish Ministry of Economy and Competitiveness. This research was also funded by the European Regional Development Fund, the Galician Regional Government (GRC2014/024, “Consolidation of Research Units: AtlantTIC Project” CN2012/160)

    Results of gravitational lensing and primordial gravitational waves from the POLARBEAR experiment

    Get PDF
    POLARBEAR is a Cosmic Microwave Background radiation (CMB) polarization experiment that is located in the Atacama Desert in Chile. The scientific goals of the experiment are to characterize the B-mode signal from gravitational lensing, as well as to search for B-mode signals created by primordial gravitational waves (PGWs). Polarbear started observations in 2012 and has published a series of results. These include the first measurement of a nonzero B-mode angular auto-power spectrum at sub-degree scales where the dominant signal is gravitational lensing of the CMB. In addition, we have achieved the first measurement of crosscorrelation between the lensing potential, which was reconstructed from the CMB polarization data alone by Polarbear, and the cosmic shear field from galaxy shapes by the Subaru Hyper Suprime-Cam (HSC) survey. In 2014, we installed a continuously rotating half-wave plate (CRHWP) at the focus of the primary mirror to search for PGWs and demonstrated the control of low-frequency noise. We have found that the low-frequency B-mode power in the combined dataset with the Planck high-frequency maps is consistent with Galactic dust foreground, thus placing an upper limit on the tensor-to-scalar ratio of r < 0.90 at the 95% confidence level after marginalizing over the foregrounds

    Improved Large Vocabulary Continuous Chinese Speech Recognition by Character-Based Consensus Networks

    No full text

    LIUM and CRIM ASR System Combination for the REPERE Evaluation Campaign

    No full text
    • 

    corecore